NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Privacy and Accuracy-Aware AI/ML Model Deduplication

https://doi.org/10.1145/3725340

Guan, Hong; Yu, Lei; Zhou, Lixi; Xiong, Li; Chowdhury, Kanchan; Xie, Lulu; Xiao, Xusheng; Zou, Jia (June 2025, Proceedings of the ACM on Management of Data)

With the growing adoption of privacy-preserving machine learning algorithms, such as Differentially Private Stochastic Gradient Descent (DP-SGD), training or fine-tuning models on private datasets has become increasingly prevalent. This shift has led to the need for models offering varying privacy guarantees and utility levels to satisfy diverse user requirements. Managing numerous versions of large models introduces significant operational challenges, including increased inference latency, higher resource consumption, and elevated costs. Model deduplication is a technique widely used by many model serving and database systems to support high-performance and low-cost inference queries and model diagnosis queries. However, none of the existing model deduplication works has considered privacy, leading to unbounded aggregation of privacy costs for certain deduplicated models and inefficiencies when applied to deduplicate DP-trained models. We formalize the problem of deduplicating DP-trained models for the first time and propose a novel privacy- and accuracy-aware deduplication mechanism to address the problem. We developed a greedy strategy to select and assign base models to target models to minimize storage and privacy costs. When deduplicating a target model, we dynamically schedule accuracy validations and apply the Sparse Vector Technique to reduce the privacy costs associated with private validation data. Compared to baselines, our approach improved the compression ratio by up to 35× for individual models (including large language models and vision transformers). We also observed up to 43× inference speedup due to the reduction of I/O operations.
more » « less
Free, publicly-accessible full text available June 17, 2026
Careful About What App Promotion Ads Recommend! Detecting and Explaining Malware Promotion via App Promotion Graph

Ma, Shang; Chen, Chaoran; Yang, Shao; Hou, Shifu; Li, Toby; Xiao, Xusheng; Xie, Tao; Ye, Yanfang (February 2025, The Internet Society)

Free, publicly-accessible full text available February 28, 2026
Careful About What App Promotion Ads Recommend! Detecting and Explaining Malware Promotion via App Promotion Graph

https://doi.org/10.14722/ndss.2025.230051

Ma, Shang; Chen, Chaoran; Yang, Shao; Hou, Shifu; Li, Toby Jia-Jun; Xiao, Xusheng; Xie, Tao; Ye, Yanfang (January 2025, Internet Society)

Full Text Available
Symbolic Prompt Tuning Completes the App Promotion Graph

https://doi.org/10.1007/978-3-031-70381-2_12

Ouyang, Zhongyu; Zhang, Chunhui; Hou, Shifu; Ma, Shang; Chen, Chaoran; Li, Toby; Xiao, Xusheng; Zhang, Chuxu; Ye, Yanfang (November 2024, Springer Nature Switzerland)

Full Text Available
Symbolic Prompt Tuning Completes the App Promotion Graph

Ouyang, Zhongyu; Zhang, Chunhui; Hou, Shifu; Ma, Shang; Chen, Chaoran; Li, Toby; Xiao, Xusheng; Zhang, Chuxu; Ye, Yanfang (September 2024, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Database (ECML-PKDD))

Full Text Available
Symbolic Prompt Tuning Completes the App Promotion Graph

Ouyang, Zhongyu; Zhang, Chunhui; Hou, Shifu; Ma, Shang; Chen, Chaoran; Li, Toby; Xiao, Xusheng; Zhang, Chuxu; Ye, Yanfang (August 2024, Springer-Verlag)

Full Text Available
NODLINK: An Online System for Fine-Grained APT Attack Detection and Investigation

https://doi.org/10.14722/ndss.2024.23204

Li, Shaofei; Dong, Feng; Xiao, Xusheng; Wang, Haoyu; Shao, Fei; Chen, Jiedong; Guo, Yao; Chen, Xiangqun; Li, Ding (January 2024, Internet Society)

Advanced Persistent Threats (APT) attacks have plagued modern enterprises, causing significant financial losses. To counter these attacks, researchers propose techniques that capture the complex and stealthy scenarios of APT attacks by using provenance graphs to model system entities and their dependencies. Particularly, to accelerate attack detection and reduce financial losses, online provenance-based detection systems that detect and investigate APT attacks under the constraints of timeliness and limited resources are in dire need. Unfortunately, existing online systems usually sacrifice detection granularity to reduce computational complexity and produce provenance graphs with more than 100,000 nodes, posing challenges for security admins to interpret the detection results. In this paper, we design and implement NODLINK, the first online detection system that maintains high detection accuracy without sacrificing detection granularity. Our insight is that the APT attack detection process in online provenance-based detection systems can be modeled as a Steiner Tree Problem (STP), which has efficient online approximation algorithms that recover concise attack-related provenance graphs with a theoretically bounded error. To utilize the frameworks of the STP approximation algorithm for APT attack detection, we propose a novel design of in-memory cache, an efficient attack screening method, and a new STP approximation algorithm that is more efficient than the conventional one in APT attack detection while maintaining the same complexity. We evaluate NODLINK in a production environment. The openworld experiment shows that NODLINK outperforms two state-ofthe- art (SOTA) online provenance analysis systems by achieving magnitudes higher detection and investigation accuracy while having the same or higher throughput.
more » « less
Full Text Available
Prompt Learning Unlocked for App Promotion in the Wild

Ouyang, Zhongyu; Hou, Shifu; Ma, Shang; Chen, Chaoran; Zhang, Chunhui; Li, Toby; Xiao, Xusheng; Zhang, Chuxu; Ye, Yanfang (November 2023, Advances in neural information processing systems)

Full Text Available
Wemint:Tainting Sensitive Data Leaks in WeChat Mini-Programs

https://doi.org/10.1109/ASE56229.2023.00151

Meng, Shi; Wang, Liu; Wang, Shenao; Wang, Kailong; Xiao, Xusheng; Bai, Guangdong; Wang, Haoyu (September 2023, IEEEACM International Conference on Automated Software Engineering)

Full Text Available
Are we there yet? An Industrial Viewpoint on Provenance-based Endpoint Detection and Response Tools

Dong, Feng; Li, Shaofei; Jiang, Peng; Li, Ding; Wang, Haoyu; Huang, Liangyi; Xiao, Xusheng; Chen, Jiedong; Luo, Xiapu; Guo, Yao; et al (July 2023, ACM)

Full Text Available

« Prev Next »

Search for: All records